RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 

Source: /fUlL ^ "~ 

©ate Processed bykSTIC: ^ 

' THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 
PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, TELEPHONE: 703-308-4212; FAX: 703-308-4221 
Effective 12/13/03 : TELEPHONE: 571-272-2510; FAX: 571-273-0221 



TO REDUCE EKRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 4,1 PROGRAM . ACCESSIBLE THROUGH THE U S PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 

httn://ww\v.iisnto.gov/\veb/offices/nac/checker/chkr4Inote.htm 



Applicants submitting genetic scquciKC infonmition eleciromciilly on diskette or CD-Roni should be iiwiirc that there 
a possibility that the disk/CD-Roni may have been affected by treatment ^en to all incoming mail. 
Please consider using alteniale metliods of submission for tJff^rfisk/CD-Roni or replacement disk/CD-Rom. * 
Any reply including a sequence listing in electronic form^ghPOld NOfTc sent to the 2023 1 zip code address for the 
United States Patent and Trademark Office, and instead should be sent via tJie following to the indicated addresses: 

1. EFS-Bio (<httn:/Av>vw.iiSDto.eov/ebc/efs/downloads/dociiments.htni> , EPS Submission 
User Maiuinl - cPAVE) 

2. U.S. Postal Scmcc: Conunissioner for Patents, P.O. Box 1450, Alexandria, VA 22313-1450 

3. Hand Carry directly to (EFFECTIVE 12/01/03): 

U.S. Patent and Trademark Office, Bo.\ Sequence, Customer Windo\y, Lobby, Room IB03, Cn'sta! Plaz;i Two, 
201 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel Serv ice, oc other delivery serv ice to: U.S. Patent and Trademark Office, 
Box Sequence, Roonv4B03-Mailroom, CryfiUil Plaza Two, 20 11 South Clark Ptlicc, Arlington, VA 22202 




Revised 10/08/03 



Raw Sequence Listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION SERIAL NUMBER: 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 



I: 



1 



10 L 



12 



13 



_Wrapped Nudeics The number/text at the end of each line "wrapped" down to the next line. This may occur ifyour file 
Wrapped Aminos was retrieved in a word processor after creating it. Please adjust your right margin to .3; this will 
prevent "wrapping." 

.Invalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces, 

^Misaligned Amino The numbering under each 5"* amino acid is misaligned. Do not use tab codes between numbers; 
Numbering use space characters, instead. 



Non-ASCII 



_ Variable Length 



_PatentIn 2.0 
"bug" 



_Skipped Sequences 
(OLD RULES) 



The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission is saved in ASCH text. 

Sequence(s) contain n*s or Xaa*s representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate iri the <220>-<223> section that some may be missing. 

A "bug" in Patentin version 2.0 has caused, the <220>-<223> section to be missing from amino acid 

sequences(s) . Normally, Patentin would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

Sequence(s) missing. If intentional, please insert the following lines for each skipped sequence- 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where **X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: (ms^^^^ where "X" is shown) 

This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences, 

missing. If intentional, please insert the following lines for each skipped sequence. 



_Skipped Sequences Sequence(s) 

(NEW RULES) <2 1 0> sequience id number 
<400> sequence id number 
000 



9 ^Use of n's or Xaa's Use of n*s and/or Xaa's have been detected in the Sequence Listing. 



(NEW RULES) 



_Invalid <213> 
Response 



... ....^ ^^^^^.^^^ L^.OLIII^. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n*s or Xaa's are present. 
In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <2I3> response is Unknown or 
is Artificial Sequence 



Use of <220> 



Sequence(s) 



^ missing the <220> "Feature" and associated numeric identifiers and responses. 



_PatentIn 2.0 
"bug" 



Misuse of n/Xaa 



Use of <220> to <223> is MANDATORY if <2 13> "Organism" response is "Artificial Sequence" or 
"Unknown." Please explain source of genetic.material in <220> to <223> section. 
(See "Federal Register." 06/01/1998. Vol. 63, No. 104, pp, 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentin version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to fioppy disk. 

"n" can only represent a single nucleotide ; "Xaa" can only represent a single amino acid 



AMC - Biotechnology Systems Branch - 09/09/2003 
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RAW SEQUENCE LISTING DATE: 03/04/2004 

PATENT APPLICATION: US/10/030 , 658A TIME: 09:44:17 

Input Set : A:\2004-02-25 4456-0101P.ST25.txt 
Output Set: N:\CRF4\03042004\a030658A.raw 

3 <110> APPLICANT: Yamamura Ken-ichi 

4 Araki Kimi 

6 <120> TITLE OF INVENTION: TRAP VECTORS AND GENE TRAPPING USING THE SAME 
8 <130> FILE REFERENCE: 4456-OlOlP 

10 <140> CURRENT APPLICATION NUMBER: 10/030, 658A 

11 <141> CURRENT FILING DATE: 2002-01-11 

13 <150> PRIOR APPLICATION NUMBER: JP99/200997 

14 <151> PRIOR FILING DATE: 1999-07-14 
16 <160> NUMBER OF SEQ ID NOS : 17 

18 <170> SOFTWARE: Patentin Ver. 2.0 ■ • 

20 <210> SEQ ID NO: 1 ■ ^'P-:/ 

21 <211> LENGTH: 13 ' ■ - ^ iV- NGeCIGr^ 

22 <212> TYPE: DNA ^ 

23 <213> ORGANISM: Artificial Sequence O 
25 <220> FEATURE: If ' 

28 <4oS SETOEf^P"""""' Description of Artificial Sequence : synthetic DNA 

29 taccgttcgt ata - 

31 <210> SEQ ID NO: 2 

32 <213> LENGTH: 13 

33 <212> TYPE: DNA 

34. <213> ORGANISM: Artificial Sequence 
36 <220> FEATURE: 

39 ^400^ sllul^STT''''^^'''' °^ Artificial Sequence : synthetic DNA 

4 0 tatacgaacg gta 

42 <210> SEQ ID NO: 3 

43 <211> LENGTH: 34 
4 4 <212> TYPE: DNA 

45 <213> ORGANISM: Artificial Sequence 

47 <220> FEATURE: 

48 <223> OTHER INFORMATION: Description of Artificial Sequence : synthetic DNA 

50 <400> SEQUENCE: 3 ^ l^l 

51 ataacttcgt atagcataca ttatacgaag ttat 34 

53 <210> SEQ ID NO: 4 

54 <211> LENGTH: 13 

55 <212> TYPE: DNA 

56 <213> ORGANISM: Artificial Sequence 
58 <220> FEATURE: 

61 ilU: SE™f°r"°" '^--^Pt-^- °^ Artificial Sequence : synthetic DNA 

62 ataacttcgt ata 

64 <210> SEQ ID NO: 5 
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RAW SEQUENCE LISTING DATE: 03/04/2004 

PATENT APPLICATION: US/10/030 , 658A TIME: 09:44:17 

Input Set : A:\2004-02-25 4456-010lP.ST25.txt 
Output Set: N:\CRF4\03042004\J030658A.raw 

65 <211> LENGTH: 13 

66 <212> TYPE: DNA 

67 <213> ORGANISM: Artificial Sequence 
69 <220> FEATURE: 

<4nnt JlnnL^r''^'^"''''^ Description of Artificial Sequence : synthetic DNA 
/z <4(JU> SEQUENCE: 5 

73 tatacgaagt tat 

75 <210>'SEQ ID NO: 6 



76 <211> LENGTH: 34 

77 <212> TYPE: DNA 

78 <213> ORGANISM^Other 

80 <220> FEATURE:^ ^ ' ' ' . ^- 

81 <223> OTHER INFORMATION: Hoinologous recombination sequence ✓4^«^^ 

83 <400> SEQUENCE: 6 

84 taccgttcgt atagcataca ttatacgaac ggta 34 

86 <210> SEQ ID NO: 7 

87 <211> LENGTH: 19 

88 <212> TYPE: DNA 

89 <213> ORGANISM: Artificial Sequence 

91 <22 0> FEATURE: 

92 <223> OTHER INFORMATION: Zl forward primer used in PGR for B-geo detection 

94 <400> SEQUENCE: 7 

95 gcgttaccca acttaatcg -,q 

97 <210> SEQ ID NO: 8 

98 <211> LENGTH: 18 

99 <212> TYPE: DNA 

100 <213> ORGANISM: Artificial Sequence 

102 <220> FEATURE: 

103 <223> OTHER INFORMATION: 7/1 reverse primer used in PGR for B-geo detection 
.105 <400> SEQUENCE: 8 

106 tgtgagcgag taacaacc -^g 

108 <210> SEQ ID NO: 9 

109 <211> LENGTH: 22 

110 <212> TYPE: DNA 

111 <213> ORGANISM: Artificial Sequence 

113 <220> FEATURE: 

114 <223> OTHER INFORMATION: Ori2 forward primer used in PGR for detecting the 
replication ^ 

115 origin region in pUC vector 

117 <400> SEQUENCE: 9 

118 gccagtggcg ataagtcgtg tc 22 

120 <210> SEQ ID NO: 10 

121 <211> LENGTH: 21 

122 <212> TYPE: DNA 

123 <213> ORGANISM: Artificial Sequence 

125 <220> FEATURE: 

126 <223> OTHER INFORMATION: Ori3 reverse primer used in PGR for detectina the 
replication ' - 

127 origin region in pUG vector 

129 <400> SEQUENCE: 10 

130 cacagaatca ggggataacg c 21 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/030 , 658A 



DATE: 03/04/2004 
TIME: 09:44:17 



Input Set : A:\2004-02-25 4456-0101P.ST25.txt 
Output Set: N:\CRF4\03042004\J030658A.raw 



132 <210> SEQ ID NO: 11 

133 <211> LENGTH: 400 

134 <212> TYPE: DNA 

135 <213> ORGANISM: Mus musculus 

137 <220> FEATURE: 

138 <221> NAME/KEY: niisc_f eature 

139 <222> LOCATION: (36).. (36) 

140 <223> OTHER INFORMATION: n is a, 

142 <22 0> FEATURE: 

143 <221> NAME/KEY: misc_f eature 

144 <222> LOCATION: (70).. (70) 

145 <223> OTHER INFORMATION: n is a, 

147 <220> FEATURE: 

148 <221> NAME/KEY: misc^f eature 

149 <222> LOCATION: (362).. (362) 

150 <223> OTHER INFORMATION: n is a, 

152 <220> FEATURE: 

153 <221> NAME/KEY: inisc_f eature 

154 <222> LOCATION: (364).. (364) 

155 <223> OTHER INFORMATION: n is a, 

157 <220> FEATURE: 

158 <221> NAME/KEY: misc_f eature 

159 <222> LOCATION: (377).. (377) 

160 <223> OTHER INFORMATION: n is a, 
163 <400> SEQUENCE: 11 

W — > 164 agaaacttaa acagcggata aacttcagtg 



165 
166 
167 
168 
169 
170 
172 
173 
174 
175 
177 
178 
179 
180 
182 
183 
184 
185 
187 
188 
189 
190 
192 



ctcaaggtan agcaacagcg gctaacaaca 
tcaacttttt gcctatgcag attaatacta 
caaagagaga gatgacaacg tcagcacagt 
atgacctttt gcaaaactgt caatctctga 
aacaccagat tgtaagcagg cttgttcaat 
cnanaagttt tgtaaanaaa atgaaagtct 
<210> SEQ ID NO: 12 
<211> LENGTH: 416 
<212> TYPE: DNA 
<213> ORGANISM: Mus musculus 
<220> FEATURE: 

<221> NAME/KEY: misc _f eature 
<222> LOCATION: (37). . (37) 
<223> OTHER INFORMATION: n is a, 
<220> FEATURE: 

<221> NAME/KEY: inisc_ feature 
<222> LOCATION: ( 363 )". . ( 363 ) 
<223> OTHER INFORMATION: n is a, 
<220> FEATURE: 

<221> NAME/KEY: misc^f eature 
<222> LOCATION: (392).. (392) 
<223> OTHER INFORMATION: n is a, 
<220> FEATURE: 



c, g, or t 



c, g, or t 



c, g, or t 



c, g, or t 



c, g, or t 

a 1 1 tana tea 

aacgt cagct 
acaagagcaa 
gcaaagagtt 
agaagatggg 
cctgactata 
gcaatgttga 



c, g, or t 



c, g, or t 



c, g, or t 



gagaagtatt 

tagtgaaaac 
ggatgctact 
gtttgcttct 
agaggggagc 
ttactaaagc 



ggaagtgatt 60 

cgaaagccct 120 
gcaagtcttc 180 
gctctaagta 240 

ctgcatggga 300 
tagttctatg 360 
400 
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RAW SEQXJENCE LISTING DATE; 03/04/2004 

PATENT APPLICATION: US/lO/030 , 658A TIME: 09:44:17 

Input Set : A:\2004-02-25 4456-OlOlP, ST25 . txt 
Output Set: N: \CRF4\03042004\ J030658A. raw 

193 <221> NAME/KEY: misc__f eature 

194 <222> LOCATION: (401).. (401) 

195 <223> OTHER INFORMATION: n is a, c, g, or t 

197 <220> FEATURE: 

198 <221> NAME/KEY: misc_f eature 

199 <222> LOCATION: (403).. (403) 

200 <223> OTHER INFORMATION: n is a, c, g, or t 
203 <400> SEQUENCE: 12 

onf ttgcagcata aagcagagca agctatnagc tgtgatggat gactctgttg 60 

205 ttacagaaac tacaggaagc ttatctggag tcagcatcac atctgaacta aatgaagaac 120 

206 tgaatgattt aattcagcgt ttccataatc agcttcgtga ttctcagcct ccagctgttc 180 

207 cagacaacag aagacaggca gaaagtcttt cattaactag agagatttct cagagcagaa 240 

208 atccctcagt ttctgaacat ttacctgatg agaaagtaca gctttttagc aaaatgagag 300 

209 tactacagga aaagaacaag aaatggacaa attagttggg agaacttcat aaccttcgag 360 

210 atnagcatct gaacaactca tcatttgtgc cntcaacttc ncnccaaaga aqtqaa 416 

212 <210> SEQ ID NO: 13 y sd yyy 

213 <211> LENGTH: 484 

214 <212> TYPE: DNA 

215 <213> ORGANISM: Mus musculus 

217 <220> FEATURE: 

218 <221> NAME/KEY: misc_f eature 

219 <222> LOCATION: (33).. (33) 

220 <223> OTHER INFORMATION: n is a, c, g, or t 

222 <220> FEATURE: 

223 <221> NAME/KEY: misc_f eature 

224 <222> LOCATION: (48).. (48) 

225 <223> OTHER INFORMATION: n is a, c, g, or t 

227 <220> FEATURE: 

228 <221> NAME/KEY: misc_f eature 

229 <222> LOCATION: (54).. (54) 

230 <223> OTHER INFORMATION: n is a, c, g, or t 

232 <220> FEATURE: 

233 <221> NAME/KEY: misc_ feature 

234 <222> LOCATION: (89).. (89) 

235 <223> OTHER INFORMATION: n is a, c, g, or t 

237 <220> FEATURE: 

238 <221> NAME/KEY: misc_f eature 

239 <222> LOCATION: (244).. (244) 

240 <223> OTHER INFORMATION: n is a, c, g, or t 

242 <220> FEATURE: 

243 <221> NAME/KEY: mis cofeature 

244 <222> LOCATION: (257).. (257) 

245 <223> OTHER INFORMATION: n is a, c, g, or t 
248 <400> SEQUENCE: 13 

^'"^ oin ^^^^^^'^^^^^ ctactgaaca gcagcagcca ttnagctcaa aatccttnca gggnaaaaca 60 

250 gagtatatgg cttttccaaa accctctgna aagcagttct tctcttggag cagaaaagca 120 

251 aaggaatcaa gaaacagccc gaagaggaag ctgaaaacac taagacacca tggttatatg 180 

252 atcaagaagg tggagtagaa aaaccatttt tcaagactgg atttacagag tctgtagaga 240 

253 aagntacaaa atagtanccg caaaaatcaa ccagatacaa gcaggagaag acgtcggttt 300 
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RAW SEQUENCE LISTING DATE: 03/04/2004 

PATENT APPLICATION: US/10/030 , 658A TIME: 09:44:17 

Input Set : A:\2004-02-25 4456-0101P.ST25.txt 
Output Set: N:\CRF4\03042004\J030658A.raw 

254 gatgaagaat cccttggaaa gctttagcag tatgcctgat cctatagacc caacatcagt 360 

255 aactaaaaca tttaaaacaa gaaaagcatc tgcccaggcc agcctggcct ctaaggacaa 420 

256 aactcccaaa tcaaagagta agaagaggat tctactcagc tgaaaagtag agttaaaaat 480 

257 attg 

260 <210> SEQ ID NO: 14 

261 <211> LENGTH: 211 

262 <212> TYPE: DNA 

2 63 <213> ORGANISM: Mus musculus 
265 <400> SEQUENCE: 14 

2 66 ctgtctgtca ttgtcgttct cctttagaag gcagaaaaga aatgggaaga • aaaaaggcaa 60 

267 aatctggaac actataacgg aaaggagttc gagaagctcc tggaggaagc tcaggccaac 120 

268 atcatgaagt caattccaaa cctggagatg cccccagctt ccagcccagt gtcaaaggga 180 

269 gatgcggcag gggataagct ggagctgtca g on -i 

272 <210> SEQ ID NO: 15 

273 <211> LENGTH: 34 

274 <212> TYPE: DNA 

275 <213> ORGANISM: Artificial Seguence 
27 7 <22 0> FEATURE: 

278 <223> OTHER INFORMATION: Description of Artificial Sequence : synthetic DNA 

280 <400> SEQUENCE: 15 

281 taccgttcgt atagcataca ttatacgaag ttat 34 

283 <210> SEQ ID NO: 16 

284 <211> LENGTH: 34 

285 <212> TYPE: DNA 

286 <213> ORGANISM: Artificial Sequence 

288 <220> FEATURE: 

289 <223> OTHER INFORMATION: Description of Artificial Sequence : synthetic DNA 

291 <400> SEQUENCE: 16 

292 ataacttcgt atagcataca ttatacgaac ggta 34 

294 <210> SEQ ID NO: 17 

295 <211> LENGTH: 34 

296 <212> TYPE: DNA 

297 <213> ORGANISM: Artificial Sequence 

299 <220> FEATURE: 

300 <223> OTHER INFORMATION: Description of Artificial Sequence : synthetic DNA 

302 <400> SEQUENCE: 17 y 

303 tattgaagca tatcgtatgt aatatgcttc aata 34 



file://C:\CRF4\Outhold\VsrJ030658A.htm 



3/4/04 



± ct^C U Oi 6 



RAW SEQUENCE LISTING ERROR SUMMARY DATE: 03/04/2004 

PATENT APPLICATION: US/10/030 , 658A TIME: 09:44:18 

Input Set : A:\2004-02-25 4456-OlOlP. ST25 , txt 
Output Set: N:\CRF4\03042004\J030658A.raw 



Please Note: 



use of n and/ox Xaa have been detected in the Sequence Listing. Please review the 
!^'^ooo^ ^^^fi"^ ^° ensure that a corresponding explanation is presented in the <220> 
to <2Z3> fields of each sequence which presents at least one n or Xaa. 



Seq#:ll; N Pos . 36,70,362,364,377 
Seq#:12; N Pos. 37,3 63,392,401,4 03 
Seq#:13; N Pos . 33,48,54,89,244,257 
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VERIFICATION SUMMARY DATE: 03/04/2004 

PATENT APPLICATION: US/lO/030 , 658A TIME: 09:44:18 

Input Set : A:\2004-02-25 4456-0101P.ST25.txt 
Output Set: N:\CRF4\03042004\J030658A.raw 

L:164 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:11 after pos . : 0 
M:341 Repeated in SeqNo=ll 

L:204 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 12 after pos -0 
M:341 Repeated in SeqNo=12 

L:249 M:341 W: (46) "n" or, "Xaa" used, for SEQ ID#:13 after pos . : 0 
M:341 Repeated in SeqNo=13 
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